English Event Detection With Translated Language Features
نویسندگان
چکیده
We propose novel radical features from automatic translation for event extraction. Event detection is a complex language processing task for which it is expensive to collect training data, making generalisation challenging. We derive meaningful subword features from automatic translations into target language. Results suggest this method is particularly useful when using languages with writing systems that facilitate easy decomposition into subword features, e.g., logograms and Cangjie. The best result combines logogram features from Chinese and Japanese with syllable features from Korean, providing an additional 3.0 points f-score when added to state-of-the-art generalisation features on the TAC KBP 2015 Event Nugget task.
منابع مشابه
Rhetorical Variation in Medical Article Abstracts Written in English and Persian
The present study aimed at finding structural variations in the translated abstracts from Persian into English and comparing them with abstracts originally written in English and published in international journals in the area of medical sciences. To do this, 64 medical article abstracts (32 in Iranian journals, 32 in international journals) were analyzed and compared on the basis of Swales’ mo...
متن کاملDRAMA TRANSLATION from PAGE to STAGE
This study is the result of an attempt to investigate the differences between the Persian translated drama text (page) of each English drama text with its performance on the stage (stage) in Iran. In other words, the present researchers tried to find the implemented changes in page which make it real on the stage in the target language and culture in order to show that in drama translating and ...
متن کاملEnglish-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملA Language-Independent Neural Network for Event Detection
Event detection remains a challenge due to the difficulty at encoding the word semantics in various contexts. Previous approaches heavily depend on languagespecific knowledge and pre-existing natural language processing (NLP) tools. However, compared to English, not all languages have such resources and tools available. A more promising approach is to automatically learn effective features from...
متن کاملA Stylistic Approach to Translation: Figurative language devices in the Persian renderings of Alcott's Little Women
The present study aimed firstly at investigating the impact of translators' style on figurative language translation from English into Persian. Secondly, it intended to find which strategies were most frequently adopted by Persian translators to translate figures of speech into Persian. Lastly, the study sought to check the extent of transference of figurative features of literary texts in Engl...
متن کامل